Minimization of the Disagreements in Clustering Aggregation

نویسندگان

  • Safia Nait Bahloul
  • Baroudi Rouba
  • Youssef Amghar
چکیده

Several experiences proved the impact of the choice of the parts of documents selected on the result of the classification and consequently on the number of requests which can answer these clusters. The process of aggregation gives a very natural method of data classification and considers then m produced classifications by them m attributes and tries to produce a classification called "optimal" which is the most close possible of m classifications. The optimization consists in minimizing the number of pairs of objects (u, v) such as a C classification place them in the same cluster whereas another C' classification place them in different clusters. This number corresponds to the concept of disagreements. We propose an approach which exploits the various elements of an XML document participating in various views to give different classifications. These classifications are then aggregated in the only one classification minimizing the number of disagreements. Our approach is divided into two steps: the first consists in applying the K-means algorithm on the collection of XML documents by considering every time a different element from the document. Second step aggregates the various classifications obtained previously to produce the one that minimizes the number of disagreements.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی کاربرد روش فازی (Fuzzy) در طبقه‌بندی خاک‌ها، مطالعه موردی: چشمه سفید کرمانشاه

Chenges in the soil characteristics is rather continuously. A method that takes this continuity into account would present a realistic pattern of soil distribution either in taxonomic or geographical space. The fuzzy set theory provides such an approach. In this study, the robustness of fuzzy clustering in soil pattern recognition was evaluated in a subcatchment of western Iran. The clustering ...

متن کامل

بررسی کاربرد روش فازی (Fuzzy) در طبقه‌بندی خاک‌ها، مطالعه موردی: چشمه سفید کرمانشاه

Chenges in the soil characteristics is rather continuously. A method that takes this continuity into account would present a realistic pattern of soil distribution either in taxonomic or geographical space. The fuzzy set theory provides such an approach. In this study, the robustness of fuzzy clustering in soil pattern recognition was evaluated in a subcatchment of western Iran. The clustering ...

متن کامل

EIDA: An Energy-Intrusion aware Data Aggregation Technique for Wireless Sensor Networks

Energy consumption is considered as a critical issue in wireless sensor networks (WSNs). Batteries of sensor nodes have limited power supply which in turn limits services and applications that can be supported by them. An efcient solution to improve energy consumption and even trafc in WSNs is Data Aggregation (DA) that can reduce the number of transmissions. Two main challenges for DA are: (i)...

متن کامل

درخت تجمیع داده براساس الگوریتم پویای شکل گیری رودخانه درشبکه حسگر بی سیم

One of the main challenges in Wireless Sensor Networks is the limited energy of nodes which can cause to reduce the lifetime of nodes and whole network respectively. Transmissions between the nodes consumes most of the nodes' energy so minimization of unnecessary transmissions can led to reduction of energy consumption. Therefor routing protocols designed based on optimal energy consumption are...

متن کامل

Online Aggregation of Coherent Generators Based on Electrical Parameters of Synchronous Generators

This paper proposes a novel approach for coherent generators online clustering in a large power system following a wide area disturbance. An interconnected power system may become unstable due to severe contingency when it is operated close to the stability boundaries. Hence, the bulk power system controlled islanding is the last resort to prevent catastrophic cascading outages and wide area bl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008